NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Trusting Your Evidence: Hallucinate Less with Context-aware Decoding

Shi, Weijia; Han, Xiaochuang; Lewis, Mike; Tsvetkov, Yulia; Zettlemoyer, Luke; Yih, Wen-tau (June 2024, NAACL)

Language models (LMs) often struggle to pay enough attention to the input context, and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we present context-aware decoding (CAD), which follows a contrastive output distribution that amplifies the difference between the output probabilities when a model is used with and without context. Our experiments show that CAD, without additional training, significantly improves the faithfulness of different LM families, including OPT, GPT, LLaMA, and FLAN-T5 for summarization tasks (e.g., 14.3{\%} gain for LLaMA in factuality metrics). Furthermore, CAD is particularly effective in overriding a model{'}s prior knowledge when it contradicts the provided context, leading to substantial improvements in tasks where resolving the knowledge conflict is essential.
more » « less
Full Text Available
EFFICIENT STREAMING LANGUAGE MODELS WITH ATTENTION SINKS

Xiao, Guangxuan; Tian, Yuandong; Chen, Beidi; Han, Song; Lewis, Mike (May 2024, The Twelfth International Conference on Learning Representations)

Full Text Available
Measuring and Narrowing the Compositionality Gap in Language Models

https://doi.org/10.18653/v1/2023.findings-emnlp.378

Press, Ofir; Zhang, Muru; Min, Sewon; Schmidt, Ludwig; Smith, Noah A; Lewis, Mike (December 2023, Findings of the Association for Computational Linguistics: EMNLP 2023)

Full Text Available
Nonparametric Masked Language Modeling

https://doi.org/10.18653/v1/2023.findings-acl.132

Min, Sewon; Shi, Weijia; Lewis, Mike; Chen, Xilun; Yih, Wen-tau; Hajishirzi, Hannaneh; Zettlemoyer, Luke (January 2023, ACl Findings)

Full Text Available
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

https://doi.org/10.18653/v1/2023.emnlp-main.741

Min, Sewon; Krishna, Kalpesh; Lyu, Xinxi; Lewis, Mike; Yih, Wen-tau; Koh, Pang; Iyyer, Mohit; Zettlemoyer, Luke; Hajishirzi, Hannaneh (January 2023, Association for Computational Linguistics)

Full Text Available
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

https://doi.org/10.18653/v1/2022.emnlp-main.759

Min, Sewon; Lyu, Xinxi; Holtzman, Ari; Artetxe, Mikel; Lewis, Mike; Hajishirzi, Hannaneh; Zettlemoyer, Luke (January 2022, EMNLP)

Full Text Available

Search for: All records